Method of making high coupling ratio DMOS electrically programmable ROM

ABSTRACT

An electrically programmable memory array of the floating gate type with a high coupling ratio is made by a DMOS process which allows the edges of the floating gates to be self-aligned with the edges of the control gates and produces improved characteristics in the form of higher gain and lower body effect. The source and drain regions are formed prior to applying the first level polysilicon by a process which leaves these regions covered with thick oxide, rather than using the polysilicon as a mask to define the gate areas. Double-diffused regions are formed on one or both sides of the channel, also beneath thick oxide, instead of using a P+ tank. The ratio of the capacitance between the floating gate and control gate to the total capacitance at the floating gate is increased and the degradation in the cell performance usually caused by the P+ tank is avoided.

BACKGROUND OF THE INVENTION

This invention relates to semiconductor memory devices and methods of manufacture, and more particularly to an electrically programmable read only memory (EPROM) of the floating gate type.

Nonvolatile memory devices using a floating gate to retain charge are made by a double level polysilicon process as set forth in U.S. Pat. No. 4,122,544 issued to David J. McElroy and U.S. Pat. No. 4,112,509 issued to Lawrence S. Wall, both assigned to Texas Instruments, or in U.S. Pat. No. 3,984,822 issued to Simko et al. Other EPROM and electrically erasable EPROM cells and processes are shown in pending applications Ser. No. 957,518, filed Nov. 2, 1978, by Kuo and Tsaur and Ser. No. 1,097, filed Jan. 5, 1979, by Guterman and Chiu, both assigned to Texas Instruments. Some of these types of devices are widely used in microcomputers, particularly in program development.

Prior application Ser. No. 75,854, filed Sept. 17, 1979, by D. C. Guterman, assigned to Texas Instruments, discloses an EPROM device and process which provides higher coupling ratios; this application is an improvement of that of said Guterman application, and employs a double-diffused structure to replace the usual P+ tank, as set forth in prior application Ser. No. 072,504, filed Sept. 4, 1979, assigned to Texas Instruments.

The stacked gate structure of these prior EPROM cells include a first level polysilicon floating gate memory element and a second level poly control gate, producing a transistor which is inherently of lower gain than a comparable standard single-level silicon gate device due to three factors. First, the channel is doped P+ to enhance programming efficiency, at the expense of reduced K' in linear operation. Second, high channel doping results in earlier saturation as indicated by the larger alpha, usually about 2.0, whereas for lightly doped channels alpha is approximately unity; the drain voltage at saturation is approximately equal to gate voltage minus threshold voltage divided by alpha. Third, the channel is controlled directly by the floating gate, whose potential is governed in turn by the applied control gate voltage and the capacitance coupling ratio from floating to control gate vs. total capacitance seen by the floating gate given to first approximation by:

    Coupling Ratio=(Cf-c/Cf-c+Cf-ch)

Where Cf-c is the capacitance between the floating gate and the control gate and Cf-ch is the capacitance between the floating gate and the channel. The floating gate in effect shields the channel from the control gate, since it is a conductor, so the only way the voltage on the control gate can influence the channel is by capacitive coupling to the floating gate. If the capacitances are equal in the above formula, the coupling ratio is 50%, so one-half the control gate voltage is coupled to the floating gate and a 5 V logic level becomes a 2.5 V level as seen by the channel. Typical coupling factors in EPROM devices now in volume production having 0.2 mil channel widths and 0.15 mil overlap on each side onto field oxide can exceed 65%. The coupling ratio is of course a function of the dielectric thickness in the two capacitors as formed by gate oxide and interlevel oxide. For those production devices, the ratio of dielectric thickness is about 1.3 interlevel to 1.0 gate because the oxide thicknesses are about 1000 and 800 A, respectively. To increase the coupling ratio, significant portions of the floating gate must extend out over field oxide and be overlapped by the second level poly control gate, requiring excess spacing for alignment. Further, as the cell size is reduced for higher cell density, as for a 128 K bit device, the channel widths become much narrower, helping the coupling ratio but causing low and poorly controlled channel width-to-length ratios.

In the above-mentioned Guterman application, a process for making an EPROM is disclosed which results in significant improvement in coupling ratio and yet no compromise in circuit density or process simplicity. In the Chiu and Lien application, the problems associated with the P+ tank ordinarily used to improve programming efficiency are addressed.

It is the principal object of this invention to provide an improved electrically programmable memory, particularly with improved coupling ratio and reduced cell size. An additional object is to provide a dense array of EPROM cells having improved characteristics, made by a more efficient method. Another object is to provide a method of making an EPROM cell which has higher gain and reduced body effect.

SUMMARY OF THE INVENTION

In accordance with an illustrative embodiment of the invention an electrically programmable memory array of the floating gate type with a high coupling ratio is made by a process which allows the edges of the floating gates to be self-aligned with the edges of the control gates and produces improved characteristics in the form of higher gain and lower body effect. The source and drain regions are formed prior to applying the first level polysilicon by a process which leaves these regions covered with thick oxide, rather than using the polysilicon as a mask to define the gate areas. Double-diffused regions are formed on one or both sides of the channel, coincident with the edge of the thick oxide, instead of using a P+ tank. The ratio of the capacitance between the floating gate and control gate to the total capacitance at the floating gate is increased and the degradation in the cell performance usually caused by the P+ tank is avoided.

BRIEF DESCRIPTION OF THE DRAWINGS

The novel features believed characteristic of the invention are set forth in the appended claims. The invention itself, however, as well as other features and advantages thereof, will be best understood by reference to the detailed description which follows, read in conjunction with the accompanying drawings, wherein:

FIG. 1 is a greatly enlarged plan view of a small portion of a semiconductor chip showing the physical layout of a part of an EPROM array made according to the invention;

FIG. 2 is an electrical schematic diagram of the ROM of FIG. 1;

FIGS. 3a-3d are elevation views in section of the cell of FIG. 1, taken along the lines a--a, b--b, c--c, and d--d, respectively; and

FIGS. 4a-4e are elevation views in section of the EPROM array of FIGS. 1 and 3a-3d, at successive stages in the manufacturing process, taken generally along the line a--a in FIG. 1.

FIGS. 5a-5c shows successive stages in the manufacture of a single sided device.

DETAILED DESCRIPTION OF SPECIFIC EMBODIMENTS

With reference to FIGS. 1, 2, and 3a-3d, an electrically programmable read only memory is illustrated which is made according to the invention. The array consists of a large number of cells 10, only four of which are shown. Each cell is an MOS transistor having a control gate 11, a source 12 and a drain 13. The gates 11 are parts of polysilicon strips 14 which are the X address lines for the array. The sources are part of an N+ diffused region 15 which is connected to a ground or Vss line 16, while the drains are part of N+ diffused regions 17 which are connected to Y output lines 18. A floating gate 19 is interposed between the control gate 11 and the channel in each cell 10. Shallow extensions 12' and 13' of the source and drain regions 12 and 13 are formed as will be explained, and P regions 20 replace the P+ tank.

The array typically contains perhaps 64 or 128 K bits on a bar less than about 1/30 inch square, depending upon the bit density. The four cells 10 shown would be on a minute part of the bar, perhaps about one mil wide. A 64 K EPROM would require 256 of the X address lines 14 and 256 of the Y lines 18, providing 65,536 bits.

A thin gate oxide layer 21 separates the floating gate 19 from the silicon surface, and another thin thermal oxide layer 22 separates the floating gate from the control gate 11 in each cell. A thick layer 23 of deposited oxide overlies the upper level of polysilicon. A thick field oxide coating 24 covers parts of the bar not occupied by the transistors or diffused interconnects (i.e., "moats"), and P+ channel stop regions 25 are formed underneath all the thick field oxide. A thinner field oxide coating 26 covers the N+ diffused regions 12, 13, 15 and 17.

In order to reduce the series resistance of the elongated conductive N+ regions 15, metal strips 16 may be connected to the regions periodically. For example, a metal-to-silicon contact 27 may be made once every eight or sixteen cells, depending upon the resistivity of the N+ regions. These metal strips are particularly important for programming where higher currents are used than for the read mode.

The cell array is programmed by injection of electrons into the floating gates 19 by application of high voltage to a selected one of the polycrystalline silicon strips 14 and one of the Y lines 18 to raise the threshold voltage of the selected one of the cells 10 to a value above that which will be turned on by a 5 V. logic level voltage on an address line 14.

Injection occurs with drain 13 and gate 11 held at high voltage, typically +15 and +25 V, respectively, and source 12 held at Vss, resulting in a large current flow in the channel, and causing electrons of high energy state to traverse the gate oxide layer 21 and charge the floating gate 19. After the programming voltage is removed, the floating gate remains charged. All other cells with low voltage on either gate or drain will not be affected; that is, cells with either the X line 14 or Y line 18 low are not programmed. The array is erased by ultraviolet light, or perhaps electrically as set forth in U.S. Pat. No. 4,122,544.

Turning now to FIGS. 4a-4e, a process for making the EPROM array of the invention will be described. The starting material is a slice of P-type monocrystalline silicon, typically four inches in diameter and twenty mils thick, cut on the <100> plane, of a resistivity of about 12 to 15 ohm-cm. As mentioned above, in the FIGURES the portion shown of the bar 30 represents only a very small part of the slice, perhaps 1 or 2 mils wide. After appropriate cleaning, the slice is oxidized by exposing to oxygen in a furnace at an elevated temperature of perhaps 1100° C. to produce an oxide layer 31 over the entire slice of a thickness of about 1000 A. Next, a layer 32 of silicon nitride of about 1000 A thickness is formed over the entire slice by exposing to an atmosphere of dicholoro silane and ammonia in a reactor. A coating of photoresist is applied to the entire top surface of the slice, then exposed to ultraviolet light through a mask which defines the desired pattern of the thick field oxide 24 and the P+ channel stop 25. The resist is developed, leaving areas where nitride is then etched away by a nitride etchant, removing the exposed part of the nitride layer 32 but leaving in place the oxide layer 31.

Using the photoresist and nitride as a mask, the slice is now subjected to an ion implant step to produce the channel stop regions 25, whereby boron atoms are introduced into unmasked regions 33 of silicon. The regions 33 do not exist in the same form in the finished device, because some of this part of the slice will have been consumed in the field oxidation procedure. Usually the slice would be subjected to a heat treatment after implant but prior to field oxide growth, as set forth in U.S. Pat. No. 4,055,444, issued to G. R. Mohan Rao, assigned to Texas Instruments.

The next step in the process is the initial formation of field oxide 24, which is done by subjecting the slices to steam or an oxidizing atmosphere at about 900° to 1000° C. for several hours. This causes a thick field oxide region or layer 24 to be grown, as seen in FIG. 4b, extending into the silicon surface as silicon is consumed during oxidization. The parts of the nitride layer 32 remaining on the slice mask oxidation. The thickness of this layer 24 is about 7000 A, about half of which is above the original surface and half below. The boron doped P+ regions 33 formed by implant will be partly consumed, but will also diffuse further into the silicon ahead of the oxidation front producing P+ field stop regions 25 much deeper than the original regions 33. Additional thickness of the layer 24 results from subsequent heat treatments.

The slice is now subjected to another photoresist operation to define the source and drain areas 12 and 13 as well as the regions 15 and 17 which are to be N+ diffused. A nitride etchant removes the parts of the nitride layer 32 now exposed by holes in the photoresist, then parts of the oxide layer 31 exposed when this nitride is removed are etched to expose bare silicon. A phosphorus diffusion produces the N+ regions 34 which will subsequently become the sources, drains, etc. Instead of diffusion, these N+ regions 34 may be formed by an arsenic ion implant, in which case the oxide layer 31 would be left in place and an anneal step used before the subsequent oxidation.

The double-diffused regions are now formed by first employing another photoresist operation to limit the width of the nitride-oxide mask 32,31 over the transistor channel areas as seen in FIG. 4c. Then, an arsenic implant at about 10¹³ to 10¹⁴ per cm² is done in the areas 35 to create N regions that will ultimately form the shallow source and drain extensions 12' and 13'. Following the arsenic implant, a boron implant at about 5×10¹² to 5×10¹³ per cm² is performed to create what will be the P-type regions 20. The boron implant is masked by the same photoresist and nitride-oxide layers 32,31 as used to mask the arsenic implant, so the two implants are self-aligned. Boron will diffuse much faster than arsenic in the subsequent higher temperature operations, so the region 20 will extend into the channel farther than the regions 12' and 13'. The boron doped region 20 adjacent the drain will assist in hot electron injection into the oxide 21 just as in a conventional P+ tank, but the region 20 is narrow enough to be punched through by the reverse bias voltage applied across the N to P junction. The regions 12', 13' and 20 will be covered by the oxide 26, to be described.

Referring to FIG. 4d, a second field oxidation step is now performed by placing the slice in steam or dry oxygen at about 1000° C. for several hours, oxidizing all of the top of the slice not covered by the remaining parts of the nitride layer 32 and producing field oxide 26 of about 5000 A thickness. During this oxidation, the areas of field oxide 24 grow thicker, to perhaps 10,000 A. The N+ regions 34 are partly consumed but also diffuse further into the silicon ahead of the oxidation front to create the heavily doped regions 12, 13, 15 and 17. This oxide 26 covers the regions 12', 13' and 20 except for the laterally diffused extensions which are coincident with the channel thin oxide.

Next the remaining nitride layer 32 is removed by an etchant which attacks nitride but not silicon oxide, then the oxide 31 is removed by a so-called "dip out" etch step. The gate oxide 21 is grown by thermal oxidation to a thickness of about 500 to 8000 A. Also, windows for polysilicon to silicon contacts, if needed, are patterned and etched at this point using photoresist; none are needed in the EPROM array itself but may be used in peripheral transistors.

As seen in FIG. 4e a layer of polycrystalline silicon is deposited over the entire slice in a reactor using standard techniques to a thickness of about 5000 A, then doped with phosphorus by an N+ diffusion or implant to make it highly conductive. This first level polysilicon layer may be partially patterned at this point by applying a layer of photoresist, exposing to ultraviolet light through a mask prepared for this purpose, developing, then etching the photoresist. That is, the edges 36 beneath the address lines 14, over the field oxide, are defined at this time by a photoresist operation. It is advantageous from a cell density standpoint, however, that the edges of the floating gate next to the sources and drains be etched at the same time as the second level poly so the two levels are self-aligned.

The upper surface of the first level polysilicon is oxidized by exposing the slice to an oxidizing atmosphere at about 1000° C. to create the thermal oxide layer 22 over the floating gates. The thickness of this layer is about 1000 A. A second level of polycrystalline silicon is next deposited over the slice, N+ diffused to make it highly conductive, then masked by photoresist and etched to leave the address lines 14 and the control gates 11. This etch step also defines the edges of the floating gate 19. When the first level polysilicon is first patterned the floating gates 19 are left as parts of wide elongated strips of poly running perpendicular to what will be the address lines 14. Then, when the second level poly is patterned, parts of the first level poly are removed at the same time, so the edges of the floating gates 19 coincide with the edges of the address lines 14. This allows the cell size to be smaller because no excess overlap is needed to assure that the control gates completely cover the floating gates.

A thermal layer is grown over the second level poly then the layer 23 deposited at low temperature, about 400° C. This layer 23 insulates the metal level from the second level polycrystalline silicon, and is referred to as multilevel oxide.

Referring to FIG. 3 the multilevel oxide layer 23 is patterned by a photoresist operation, exposing the contact areas 27 and 37 for metal-to-silicon contacts over the regions 15 and 17 in the cell array.

It will be noted that connection is made by leaving a segment of the nitride layer 32 over the contact areas 27 and 37 when the oxide 26 is grown, then making an N+ diffusion into these contact areas at a later time, such as when the poly is phosphorus-diffused.

The metal contacts and interconnections are made by depositing a thin film of aluminum over the entire top surface of the slice then patterning it by a photoresist mask and etch sequence. This leaves the metal strips 16 and 18.

In the cell shown above, the patterned first level of poly floating gate 19 extends out over the source and drain regions 12 and 13 without significant coupling to them because of the thickness of the oxide 26. The self-aligned patterning of the edges 36 allows complete overlap of the two polysilicon layers without the need for excess silicon area for alignment tolerance. The increase in coupling capacitance compared to conventional EPROMs is especially significant for short channel cells where the second level poly overlap of the source and drain may comprise a substantial portion of the cell. For comparable process rules and layouts for a given 128 K bit EPROM array having about 0.14×0.14 channel dimensions, coupling area to second poly can be increased by as much as 50% using the concepts of the invention, resulting in a 10% increase in coupling ratio. The new cell thus has several advantages.

The primary advantage of the device of the invention is the lower P type concentration in the central part of the channel; the P+ region or regions 20 do not change the threshold voltage and do not degrade performance (e.g. gain, body effect) as much as in conventional devices using a P+ tank. Further, there is great leverage in increasing the second poly to first poly overlap area, giving a much more favorable coupling ratio and consequently higher channel currents at a fixed gate and drain bias. Second, the self-aligned stacked gate eliminates the design overhead associated with the tolerances required for second poly to first poly alignment. Third, it minimizes the second poly to source-drain overlap capacitance, a major source of parasitic capacitance on control gates, lines 14, and address lines 18 speeding up the EPROM circuit. Fourth, the new cell process does not lead to an increase in second poly series resistance for short channel devices as would be the case if the self-aligned stacked gate concept was used with the conventional process. Fifth, this process offers the benefit of using the self-aligned concept with thick oxide over source and drain in the peripheral circuits, giving the ability to cross over moat with first level poly, which is advantageous especially in decoders.

The large coupling capability of the device described above has the additional important advantage of high gain characteristics. The lower P-type concentration in the central part of the channel; the P+ region 20 does not change the threshold voltage as much as in conventional devices using a P+ tank.

Instead of a double-sided structure as described and shown, a single-sided device may be preferable as seen in FIG. 5a (corresponding to FIG. 4c). Here the mask for the arsenic-boron implants is offset so it covers only one side of the original source-drain mask. The resultant structure seen in FIG. 5b (corresponding to part of FIG. 3c) has a region 20 on only one side, which is sufficient because it is functional for programming improvement on only the drain side anyway.

The double-sided structure may be made by a process which does not require an additional masking operation. Referring to FIG. 5c, a controlled lateral undercut of the nitride layer 32 using plasma etching leaves a structure which, with oxide 31 and poly etched beneath the nitride, provides the desired mask for the arsenic and boron diffusion. The layer of polysilicon is added in the sandwich between the oxide 31 and the nitride 32 to facilitate a deeper and more controlled undercut.

While this invention has been described with reference to illustrative embodiments, this description is not intended to be construed in a limiting sense. Various modifications of the illustrative embodiments, as well as other embodiments of the invention, will be apparent to persons skilled in the art upon reference to this description. It is, therefore contemplated that the appended claims will cover any such modifications or embodiments as fall within the true scope of the invention. 

What is claimed is:
 1. A method of making an electrically programmable semiconductor memory cell of the floating gate type comprising the steps of:forming a first field oxide coating on a face of a body of semiconductor material, the first field oxide surrounding transistor areas on said face, forming heavily doped source and drain regions on opposite sides of a channel region in said areas using a first mask, forming a double-diffused doped region adjacent at least one side of said channel in each of said areas using a second mask smaller than said first mask, the source and drain regions and the double-diffused regions being covered with a second field oxide at said face, applying a first layer of conductive material on said face separated from the channel region by thin gate oxide, patterning the first layer to partially define a floating gate which is much larger in area than said channel and extends out over said first field oxide by a substantial amount, applying a second layer of conductive material on said face overlying the first layer separated therefrom by a thin insulator coating, and thereafter patterning the second layer to define a control gate while at the same time patterning the first layer to define edges of the floating gate.
 2. A method according to claim 1 wherein the semiconductor body is P-type silicon, the heavily doped regions are N+, the double-diffused regions are N overlying P, and the conductive material is polycrystalline silicon.
 3. A method according to claim 2 wherein the first layer is patterned to define a first elongated strip including a plurality of said floating gates for a plurality of each cells in said face.
 4. A method according to claim 3 wherein the second layer is patterned to define a second elongated strip perpendicular to the first elongated strip, the second elongated strip forming the control gates of a plurality of cells on said face.
 5. A method of making a semiconductor device comprising the steps of:forming a first insulating coating on a face of a semiconductor body surrounding a plurality of active areas on said face, forming a plurality of heavily doped regions and double-diffused regions in said face in the active areas with a second insulating coating over the regions, applying a first layer of conductive material over said face to form electrodes located between the heavily doped regions and between the double-diffused regions in the active areas, the electrodes extending out over the first insulating coating, applying a second layer of conductive material on said face overlying the first layer, patterning the second layer to define a plurality of elongated strips.
 6. A method according to claim 5 wherein the conductive material is polycrystalline silicon and thin insulator separates the electrodes from the face and separates the second layer from the electrodes.
 7. A method according to claim 6 wherein the thin insulator is much thinner than said first and second insulating coatings. 